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Abstract 



Smoothed analysis of multiobjective 0-1 linear optimization has drawn considerable atten- 
f^ ' tion recently. The number of Pareto-optimal solutions (i.e., solutions with the property that no 

CN i other solution is at least as good in all the coordinates and better in at least one) for multiobjec- 

tive optimization problems is the central object of study. In this paper, we prove several lower 
Xy^ i bounds for the expected number of Pareto optima. Our basic result is a lower bound of fi^ ("■''" ^) 

^^ ' for optimization problems with d objectives and n variables under fairly general conditions on the 

• . distributions of the linear objectives. Our proof relates the problem of lower bounding the num- 

fj ' ber of Pareto optima to results in geometry connected to arrangements of hyperplanes. We use 

our basic result to derive (1) To our knowledge, the first lower bound for natural multiobjective 
optimization problems. We illustrate this for the maximum spanning tree problem with ran- 
►^ . domly chosen edge weights. Our technique is sufficiently flexible to yield such lower bounds for 

^^ ' other standard objective functions studied in this setting (such as, multiobjective shortest path, 

(^ ! TSP tour, matching). (2) Smoothed lower bound of min{17d(n'^-^-^<^('^-'°s'^)(^-®(^/'^)'), 2®(")} 

0^ ' for the 0-1 knapsack problem with d profits for 0-semirandom distributions for a version of the 

knapsack problem. This improves the recent lower bound of Brunsch and Roglin. 



1 Introduction 



Multiobjective optimization involves scenarios where there is more than one objective function to 
^ . optimize: When planning a train trip we may want to choose connections that minimize fare, 

H \ total time, number of train changes, etc. The objectives may conflict with each other and there 

may not be a single best solution to the problem. Such multiobjective optimization problems 
arise in diverse fields ranging from economics to computer science, and have been well-studied. A 
number of approaches exist in the literature to deal with the trade-offs among the objectives in 
such situations: Goal programming, multiobjective approximation algorithms, Pareto-optimality; 
see, e.g., |1HI121[T6] for references. It is the latter approach using Pareto-optimality that concerns 
us in this paper. A Pareto-optimal solution is a solution with the property that no other solution 
is at least as good in all the objectives and better in at least one. Clearly, the set of Pareto-optimal 
solutions (Pareto set in short) contains all desirable solutions as any other solution is strictly worse 
than a solution in the Pareto set. In the worst case, the Pareto set can be exponentially large as 
a function of the input size (see, e.g., [I3]). However, in many cases of interest, the Pareto set 
is typically not too large. Thus, if the Pareto set is small and can be generated efficiently, then 
it can be treated possibly with human assistance to choose among the few alternatives. Pareto 

1 



sets are also used in heuristics for optimization problems (e.g. [H]). To explain why Pareto sets 
are frequently small in practice, multiobjective optimization has recently been studied from the 
view-point of smoothed analysis [19]. We introduce some notation before describing this work. 

Notation. For positive integer n, we denote the set {1,2, ... ,n} by [n]. The multiobjective 
optimization problems we study have binary variables and linear objective functions. In a general 
setting, the feasible solution set is an arbitrary set S C {0, 1}". The problem has d linear objective 

functions v^^^ : 5 — )• M, given by v^'^\x) = X^,gM^^j 2;^, for i € [d], and {v^ ,. . . ,Vn ) S M" (so 

v^^' is also interpreted as an n-dimensional vector in the natural way). For convenience, we will 
assume, unless otherwise specified, that we want to maximize all the objectives, and we will refer 
to the objectives as profits. This entails no loss of generality. Thus the optimization problem is the 
following. 

maximize v^ (x), . . . , maximize v^ {x), (1) 

subject to: x £ S. 

For the special case of the multiobjective 0-1 knapsack problem we have d+ 1 objectives: One 
of the objectives will be the weight w = {wi, . . . , Wn), which should be minimized, and the other d 
objectives will be the profits as before: v^^' = (f} , . . . , Vn ) for i £ [d], and all the entries in w and 
v^^' come from [0, 1]. 

Let V be the dxn matrix with rows v^^' , . . . , v^'^' . We will use the partial order ^ in M'^ defined 
by x ^ y iff for all i € [d] we have Xi < yi. For a, 5 G M we say that b dominates a if 6j > aj 
for all i G [d\, and for at least one i £ [d\, we have strict inequality. We denote the relation of b 
dominating a by 6 ^ a. A solution x £ S \s said to be Pareto-optimal (or maximal under ^) if 
Vx -/< Vy for all y £ S. For the knapsack problem, we need to modify the definition of domination 
appropriately because while we want to maximize the profit, we want to minimize the weight. It 
will be clear from the context which notion is being used. For a set of points X in Euclidean space, 
let p{X) denote the number of Pareto-optima in X. 

Smoothed analysis. For our multiobjective optimization problem ([1]), in the worst case the size 
of the Pareto set can be exponential even for d = 2 (the bicriteria case). Smoothed analysis is 
a framework for analysis of algorithms introduced by Spielman and Teng |19j to explain the fast 
running time of the Simplex algorithm in practice, despite having exponential running time in 
the worst case. Beier and Vocking [3] studied bicriteria 0-1 knapsack problem under smoothed 
analysis. In our context of multiobjective optimization, smoothed analysis would mean that the 
instance (specified by V) is chosen adversarially, but then each entry is independently perturbed 
according to, say, Gaussian noise with small standard deviation. In fact, Beier and Vocking [3] 
introduced a stronger notion of smoothed analysis. In one version of their model, each entry of the 
matrix V is an independent random variable taking values in [0, 1] with the restriction that each has 
probability density function bounded above by (p, for a parameter (j) > 1. We refer to distributions 
supported on [—1, 1] with probability density bounded above by (p as (/)-semirandom distributions. 
For more generality, one of the rows of V could be chosen fully adversarially (deterministically) . As 
(j) is increased, the semirandom model becomes more like the worst case model. With the exception 
of Theorem 11.41 below, we do not require adversarial choice of a row in V. 



Previous work. Beier and Vocking [3] showed that in the above model for the bi-criteria version 
of the 0-1 knapsack problem with adversarial weights the expected number of Pareto optima is 
0((^n^); this was improved to 0{(l)v?) by [1]. Roglin and Teng [16] studied the multiobjective 
optimization problem in the above framework. They showed that the expected size of the Pareto 
set with d objectives is of the form 0{{4>nY ('^+^)). Moitra and O'Donnell [12] improved this 
upper bound to 2 • [Acpd]^ ~^^'''^ ■ n . These authors [16^ [T2] raised the question of lower bound 
on the expected number of Pareto optima. Again, these results allow one of the objectives to be 
chosen adversarially. 

An early average-case lower bound of Q.{it?) was proven in ^ for the knapsack problem with a 
single profit vector. Their result however required an adversarial choice of exponentially increasing 
weights. Recently, Brunsch and Roglin [5] proved lower bounds of the form 

J7d(min{M)('^-l°g2'^)-(i-®(i/'^)),2®W}), 

where ^d means that the constant in the asymptotic notation may depend on d. Unfortunately, 
the instances constructed by them use S that does not seem to correspond to natural optimization 
problems. 

Our results. In this paper we prove lower bounds on the expected number of Pareto optima. 
Our basic result deals with the case when every entry in the matrix V is chosen independently 
from a distribution with density symmetric around the origin. Note that we do not require that 
the distributions be identical: Each entry can have different distribution but we require that the 
distributions have a density. This generality will in fact be useful in the proof of Theorem 1 1.2 1 Note 
also that all entries of V are random unlike the results discussed above where one of the objectives 
is chosen adversarially. This makes our lower bound stronger. 

Theorem 1.1 (Basic theorem). Suppose that each entry of a d x n random matrix V is chosen 
independently according to (not necessarily identical) symmetric distributions with a density. Let 
X denote the random set {Vr : r G {0, 1}""}. Then 

This implies the simpler bound Ey piX) > ( 2(d-\) ) 

We give two proofs of this result. The two proofs have a similar essence, but a somewhat 
different form. Both proofs relate the problem at hand to some well-known results in geometry. 
This connection with geometry is new, and may be useful for future research. The first proof lower 
bounds the expected number of Pareto-optima of a point set by the expected number of vertices of 
its convex hull (up to a constant that depends on d but not on n) and then invokes known lower 
bounds on the expected number of vertices of projections of hypercubes. The second proof gives a 
characterization of maximality in terms of 0-1 vectors and then relaxes integrality to get a relaxed 
dual characterization by means of convex separation, which reduces the counting of Pareto-optima 
to lower bounding the probability that the convex hull of n random points contains the origin. This 
probability is known exactly by a theorem of Wendel. 

Interestingly, our lower bound is basically the same as the expected number of Pareto optima 
when 2"^ uniformly random points are chosen from [—1, 1]'^, which is shown to be Grf(n'^~^) in several 



papers [H [HI [7]. This raises the possibihty of a closer connection between the two models; such a 
connection could be useful as the model of uniformly random points is better understood. 

The basic theorem above corresponds to the case when the set of feasible solutions S is {0, !}"■. 
But in many interesting cases 5 is a strict subset of {0,1}": For example, in the multiobjective 
spanning tree problem n is the number of edges in an underlying network, and S is the set of 
incidence vectors of spanning trees in the network; similarly, for the multiobjective shortest path 
problem S is the set of incidence vectors of s-t paths. We can use our basic theorem to prove 
lower bounds on the size of the Pareto set for such S. Our technique is pliable enough to give 
interesting lower bounds for many standard objective functions used in multiobjective optimization 
(in fact, any standard objective that we tried): Multiobjective shortest path, TSP tour, matching, 
arborescence, etc. We will illustrate the idea with the multiobjective spanning tree problem on the 
complete graph. In this problem, we have the complete undirected graph Kn on n vertices as the 
underlying graph. Each edge e has a set of profits v^^'{e) G [—1,1] for i G [d]. The set S of feasible 
solutions is given by the incidence vectors of spanning trees of Kn . Notice that the feasible set here 
lives in {0, Ij'-a) and not in {0, 1}". 



Theorem 1.2. In the d objective maximum spanning tree problem, on Kn there exists a choice of 
A-semirandom distributions such that the expected number of Pareto- optimal spanning trees is at 

l^'^sti^Y-K 

The proof of this theorem utilizes the full power of Theorem 11.11 namely the ability to choose 
different symmetric distributions. 

In our basic theorem above. Theorem I l.H we required the distributions to be symmetric, and 
therefore that theorem does not apply to the 0-1 knapsack problem where all profits and weights 
are non-negative. With a slight loss in the lower bound we also get a lower bound for this case. 
In the multiobjective 0-1 knapsack problem we have d objectives v^^' for i G [d\ called profits and 
an additional objective w called weight. Components of p'*^ and w are all chosen from [0,1]. We 
want to maximize the profits and minimize the weight, and so the definitions of domination and 
Pareto-optimality are accordingly modified. 

Theorem 1.3. For the multiobjective 0-1 knapsack problem where all the weight components are 
1 and profit components are chosen uniformly at random from [0, 1], the expected number of Pareto 
optima is i7rf(n ). 

Theorems 11.11 or 11.31 (depending on whether one wants a bound for non-negative or unrestricted 
weights and profits) can be used in a simple way as the base case of the argument with d + \ 
objectives in [5l Section 3] to give the following improved lower bound on the expected number of 
Pareto optima when the profits are i;^-semirandom (actually, uniform in carefully chosen intervals 
of length at least 1 /(/>): 

Theorem 1.4. For any fixed d > 2 (so that the constants in asymptotic notation may depend on 
d) and for n G N and (/> > 1 there exist 

1. weights wi,. . . ^Wn>Q, 

2. intervals [ajj,6jj] C [0, 1], i G [d], j G [n] of length at least 1/0 and with aij > 0, and 

3. a sets C {0, 1}" 



such that if profits v^ are chosen independently and uniformly at random in [aij,bij], then the 
expected number of Pareto- optimal solutions of the {d + 1)- dimensional knapsack problem with 
solution set S is at least 

For general niultiobjective optimization (basically without the restriction of entries being non- 
negative) the exponent of n becomes exactly d. 

The technique of [5] requires S to be chosen adversarially, and so this is the case for Theorem ll.4l 
above as well. To our knowledge, no non-trivial lower bounds were known before our work for 
natural choices of S. This is addressed by our Theorems ll.il {S = {0, 1}") and 11.21 {S is the set of 
spanning trees of the complete graph) above, though these Theorems are for a small constant value 
of (j), and therefore do not clarify what the dependence of (f) should be. 

Very recently, Brunsch and Roglin improved the induction step of their lower bound [B] . Com- 
bining their improved result with our result yields the lower bound of min{il£;(n'^~^'^(/)'^), 2®^"^}. 

2 The basic theorem 

In this section we prove Theorem 1 1.1[ We will include two proofs that, while in essence the same, 
emphasize the geometric and algebraic views, respectively. Also the second proof is more self- 
contained. It is perhaps worth mentioning that we first discovered the second proof, and in the 
course of writing the present paper we found the ideas and known results that could be combined 
to get a more geometric proof. 

2.1 First proof 

Proof. The convex hull of X is a random polytope, a zonotope actually, that is, a linear image 
of a hypercube or, equivalently, a Minkowski sum of segments. By Theorem 12.11 [3] , every vertex 
is maximal under our partial order < for at least one of the 2*^ reflections involving coordinate 
hyperplanes. That is 

|vertices of convX| < > p{X with coordinates of points flipped by signs in e) (3) 

eG{-l,l}'* 

Our symmetry assumption followed by ^ implieq^ 

Ey(p(X)) = — • y. ^ P{^ with coordinates of points flipped by signs in e) 

ee{-l,l}'' 

> — 7 • E Ivertices of convXl 
- 2°' ' ' 

It is known [9l Theorem 1.8] that for V with columns in general position (that is, any d columns 
are linearly independent, which happens almost surely in our case) the number of vertices is equal 
to the maximal number of vertices of a (i-dimensional zonotope formed as the sum of n segments 
[101 31.1.1]. That is, almost surely: 



n — 1 



d-l 

Ivertices of convXl = 2 > 

^^ \ k 
fc=o 



^This idea is from [4]. It is used there in the opposite direction, that is, to get upper bounds on the expected 
number of vertices from upper bounds on the expected number of maximal points. 



The claimed bound follows. D 

We used the following result: 

Theorem 2.1 ([1], [15\ Theorem 4.7]). Let P be a finite subset ofW^. A vertex of the convex hull 
of P is maximal under :< in at least one of the 2'^ assignments of d signs + and — to each of the 
coordinates of the points of P. 

2.2 Second proof 

Some more definitions before getting into the proof: Set M+ = {x £ M : x > 0} and ]R„ = {x £ M : 
X < 0}. For e G {—1, 1}'^, the orthant associated with e is {(eixi, . . . ,edXd) : {xi, . . . ,Xd) G M^}. 
In particular, if e is the all I's vector then we call its associated orthant the positive orthant, and 
if e is the all —I's vector then we call its orthant the negative orthant. For a finite set of points 
P = {pi, . . . ,pk} C M'^, the conic hull is denoted cone(P) = {X^j=i aipi : ai > 0} (note that the 
conic hull is always convex). 

Proof. By linearity of expectation Ep(X) = ^j. Pr[yr maximal]. Notice that Prfl/r maximal] does 
not depend on r, so we can write E,p{X) = 2"Pr[yi maximal]. 

For the rest of the proof we will focus on finding a lower bound on this last probability. To 
understand this probability we first rewrite the event [VI maximal] in terms of a different event 
via easy intermediate steps: 

[VI maximal] = [Vr ^ VI, Vr G {0, 1}"] = [0 ^ V{1 - r), Vr G {0, 1}"] = [0 ^ Vr, Vr G {0, 1}"]. 

Now we have Pr[0 ^ Vr, Vr G {0, 1}"] > Pr[0 ^ Vr, Vr G [0, 1]"]. 

Event [0 '^ Vr, Vr G [0, 1]"] is the same as the event [cone(t;i, . . . , t;„) n Ml = {0}]. That is to 
say, the cone generated by the non-negative linear combinations of vi, . . . ,Vn does not have a point 
distinct from the origin that lies in the negative orthant. 

By the separability property of convex sets (Hahn-Banach theorem) we have that there exists 
a hyperplane i7 = {x G M'^ : {u,x) = 0} separating cone(ui, . . . ,Vn) and Ml. That is, there exists 
u G M^ \ {0} such that cone(z;i, . . . , f„) • n > and this implies 

Pr[cone(7;i,...,7;„)nMl = {0}] = Pr[3n G M^ \ {0} : cone(z;i, ... ,-;;„) •«> 0]. 

Now 

Pr[cone(t)i, . . . , f„) in a halfspace] 

< 2, Pr[cone(t;i, . . . , f„) in a halfspace with inner normal in orthant e] 

= 2''Pr[3nG M^ \ {0} : cone{vi, . . . ,Vn) ■ u> 0]. 
Clearly, we have 

[cone(z;i, . . . , Vn) in a halfspace] = [vi, . . . ,Vn in a halfspace]. 



Theorem 12.21 and the fact that the distribution of Vi is centrally symmetric and assigns measure 
zero to every hyperplane through imply 



d-l 



1 / — l\ 

Pr[t;i,...,?;.„ in ahalfspace] =Pr[0 ^ conv{t'i,...,t;„}] = — Y^ f j. 

We conclude: 

d-l 



A;=0 ^ ^ 
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We used the following result by Wendel: 



Theorem 2.2 (j20j. |18l Theorem 8.2.1]). If Xi, . . . ,Xn are independent random points in W^ 
whose distributions are symmetric with respect to and such that with probability 1 all subsets of 
size d are linearly independent, then 



Pr[0 ^ convjXi, . . . , XJ] = ^ ^ (" ^ ^) 



The linear independence condition holds in particular under the simpler assumption that no 
hyperplane through the origin is assigned positive probability by any of the n distributions. For 
example, it holds when the points are i.i.d. at random from the unit sphere. 

The following easy corollary of Theorem 1 1 . 1 1 will be useful later. 

Corollary 2.3. Theorem \l.l\ holds also when the feasible set S = { — 1, 1}". 

Corollary 2.4. Under the assumptions of Theorem \1.1\ and when the set of feasible solutions is 
S ^ {0, 1}'" we have 

^vv{x)>^Y.{''l'\ (4) 

fc=o ^ ^ 

Proof. For any given r £ S, the probability that it is Pareto-optimal in the current instance with 
solution set restricted to S is at least the probability that it is Pareto-optimal in the instance with 
solution set {0, 1}". By the symmetry of the distribution this probability is independent of r and 
by Theorem 11.11 it is at least 

d-l / 1 

n — 1 



2n+d—l / ■' 
k=0 

Linearity of expectation completes the proof. D 



3 Lower bound for multiobjective maximum spanning trees 

In this section we show that our basic result can be used to derive similar lower bounds for S 
other than those encountered earlier in this paper. We illustrate this for the case of multiobjective 
maximum spanning tree problem on the complete graph; for this problem, S is the set of incidence 
vectors of all spanning trees on n vertices. The idea of the proof is simple: "Embed" the instance 
of the basic problem into the instance of the problem at hand. The proof requires the full power of 
Theorem 11.11 It is worth noting that the direct use of Cor. 12.41 does not provide any useful bound 
for the case of spanning trees. The proof below is easily modified so that all profits are chosen from 
intervals of non-negative numbers. 
We now prove Theorem 11.21 

Proof. The idea of the proof is to embed an instance of the case S = {0, 1}'"~^ of the basic theorem 
into the tree instance. We now describe our tree instance. We identify a subgraph G of Kn (the 
complete graph on n vertices): The vertex set of G is the same as the vertex set of Kn, which 
for convenience we denote by {s, t, ui,U2, . . . , u„_2}. The edge set of G consists of the edge (s, t), 
and edges {s,Uj), {t,Uj). Thus, G consists of 2n — 3 edges. Now we choose the distribution of the 
profits for each edge of Kn. For edges outside G, the distribution for all profits is identical and it 
is simply the uniform distribution on [—1,-1/2]. For edge (s,t), the distribution is uniform over 
[1/2, 1]. And for all other edges it is uniform over [—1/2, 1/2]. Let T denote the set of spanning 
trees which include edge (s,t), and for every other vertex Uj, exactly one of {s,Uj) and {t,Uj). 
Clearly |T| = 2"~^. The result of the above choices of distributions is that all the Pareto-optimal 
spanning trees come from T: 

Claim 3.1. For any choices of profits from the intervals as specified above, if a tree T is Pareto- 
optimal then T & T . 

Proof. Fix any choice of profits as above. Suppose that a tree T' is Pareto-optimal but T' ^ T. 
Then (1) either T' has an edge e outside E{G), or (2) all its edges are from E{G) but it does not 
use edge {s,t). In case (1), remove the edges from T' that are not in E{G), and then complete 
the remaining disconnected graph to a spanning tree using edges from E{G). Clearly, the resulting 
tree is heavier than T' in each of the d weights. In case (2), add edge (s,t) to T', and from the 
resulting cycle remove some edge other than (s,t). Again, the resulting tree is heavier than T' in 
each of the d weights. D 

In the rest of the proof, i will range over \d\. The i'th profit of a spanning tree T £ T, which 
we will denote by v^^'{T), can be written as follows 



V 



n-2 

«(r) = v^'Hst) + J](t;«(5,n,)x, + ^«(t,n,-)(l - xj)), 



where Xj = Xj{T) = 1 if edge {s,Uj) is in the tree and Xj = otherwise. We have 

{v(Hs,n,)x,+v^Ht,u,){l - X,)) = ^^^n^,%) + ^^^n^,^.) ^ („W(,,^^.) _ ^W(t,n,))(x, - \). 

Now, to compute the lower bound on the expected size of the Pareto set we reveal the v's in two 
steps: First we reveal (f'*^(s,Uj) + v^^'{t,Uj)) for all Uj. Then the conditional distribution of each 



{v^^'{s,Uj) — v^^'{t,Uj)) is symmetric (but can be different for different i). Thus the i'th profit 
of T G r is 7;»(r) = Eze[n~2]iv^'Hs,Uj) - v^'\t,Uj)){xj - 1/2) + ^», where ^» = z;»(s,t) + 

^ gr _2] - — ^'"^2^ — '^ ■ Since ^4'*^ is common to all trees, only the first sum in the profit matters 
in determining Pareto-optimality. Now we are in the situation dealt with by Cor. 12.31 For each fixing 
of (f(*)(s,Uj) + v^^'{t,Uj)), we get an instance of Cor. 12.31 and thus a lower bound of { 2id^i) )'^~^- 

Since this holds for each fixing of {v^^'{s,Uj) + v^'^\t,Uj)), we get that the same lower bound holds 
for the expectation without conditioning. D 

4 0-1 Knapsack 

We prove Theorem 11.31 

Proof. To show our lower bound we will use the obvious one-to-one map between our basic problem 
with d objectives and the profits of the knapsack problem: Let v^^' , ■ ■ ■ , v^ ' be an instance of our 
basic problem with all the v^ being chosen uniformly at random from [—1/2, 1/2]. Now the profits 

p are obtained from the u's in the natural way: p^ = u:- + 1/2. In general, the set of Pareto optima 
for these two problems (the basic problem instance and its corresponding knapsack instance) are 
not the same. We will focus instead on the better behaved set S C {0, 1}" of solutions having 
exactly [n/2\ ones. From Corollary 12.41 we get that, in the basic problem restricted to S, the 
expected number of Pareto optima is at least ^d{'>T''^~^'^) (using the well-known approximation 

(k2j) = 0(2VVH))- 

Now we claim that if x € 5 is Pareto-optimal in the restricted basic problem, then it is also 

Pareto-optimal in the corresponding (unrestricted) knapsack problem. Let y S {0, 1}" be different 

from X. There are two cases: If y has more than [n/2j ones, then it cannot dominate x, as y has a 

strictly higher weight (recall that all the weights are 1). If y has at most [n/2j ones, then enlarge 

this solution arbitrarily to a solution y' ^ y with exactly [n/2\ ones. The maximality of x implies 

that y' is worse in some profit, and so is y, as the profits are non-negative. D 

5 Improved lower bound in the semi-random model 

We prove Theorem 11.41 

Proof. We only describe the differences with the argument in the proof of [5l Theorem 8] . As given 
by Theorem 11.31 (but scaling the profits by l/(j), which does not change the set of Pareto-optima) , 
we start with a distribution on knapsack instances with d profits and Up objects (to be determined 
later) having unit weights and profits uniformly distributed in [0, l/<^], and expected number of 
Pareto-optima at least 0(n''~^'^). We use Uq (to be determined later) "cloning steps". Each step 
introduces d new objects while multiplying the number of Pareto-optima by at least 2'^/d. As in 
[5], objects used by the splitting step can have profits that are larger than 1, therefore they are 
split into many objects with profits distributed in [0, 1], and a suitable choice of the set S ensures 
that objects representing the splitted version of another behave as a group. 

A simple modification of the argument leading to [5l Corollary 11], using our base case with Up 
objects described in the previous paragraph instead of their base case with 1 object, implies that 
the expected number of Pareto-optima of the constructed instance is Q{np~^'^ {2'^ / d)""^) . 



Now we need to choose values of np and nq to get a bound in terms of n. By [5l Lemma 11], 
the total number of objects is at most 

We choose Uq so that the second term is no more than n/4 for n and (p sufficiently large. Such a 
choice of Uq is given by Uq = [hq\ with 

log0 
n„ 



logi^ 



<f>-d 

when 4:(f < n/4 and (p > 2d. 

Clearly there can be no more than 2" Pareto optima, and therefore there must be a point where 
increasing (p does not increase the lower bound. Say, for 

we have that the second term in ^ is no more than n/2. Finally, choosing Up = \n/2\ ensures 
that dS]) is at most n. 

As explained in the first paragraph, the expected number of Pareto-optima of the whole con- 
struction is at least 

n ("n^-i-s (^y) > n ("n^-i-s (^y) > J7(n'^-i-5</,('^-i°g'^)(i-0(i/</'))). 

When (j) violates ©, we construct the same instance as above with maximum density equal to 
the unique cp satisfying cp = ( -j-^ I {(p is about 2"'^ ). We get h'^ = n/2d and, as before, the 




expected number of Pareto-optima is at least 

/ /od\ ri/2d\ 

> 0(2®(")) 

D 
6 Discussion and Conclusion 

We proved lower bounds for the average and smoothed number of Pareto optima by introducing 
geometric arguments to this setting. Our lower bound is of the form Q{n ), ignoring the depen- 
dence on (p. The best upper bound we know, even for (p = 1, is that of Moitra and O'Donnell [T2] 
which is of the form 0{n ~'^), again ignoring the dependence on (p. Thus there is a gap between 
the upper and lower bounds. As mentioned before, the number of Pareto optima for the case when 
2" points are chosen uniformly at random from [—1, 1]"^ is ©(^(n'^"^). 

Do lower bounds similar to ours hold for any sufficiently large feasible set 5? Our techniques 
can show this for natural objectives, but require arguments tailored to the specific objective. It is 
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desirable to have a general lower bound technique that works for all sufficiently large S. Also, in 
smoothed lower bounds, to get a good dependence on (j) we need to use the technique of [5], which 
requires a very special choice of S. So, a more general question is whether we can prove lower 
bounds with strong dependence on (j) for all sufficiently large S. 

We now briefly discuss some difficulties in proving lower bounds for general S. One approach 
to this end is to show a lower bound on the expected size of the Pareto set that depends only on 
|5|, n and d. Our general technique was to first reduce the problem to lower bounding the expected 
number of vertices in the projection of the convex hull of the points in 5 to a random subspace 
of dimension d. A special distribution which is instructive to consider here, and also interesting in 
its own right, is given by the case when we project to a (i-dimensional space chosen uniformly at 
random. The expected number of vertices in the projection has been studied for the special cases of 
the simplex, the cube, and the crosspolytope (see Schneider |17|). But understanding this number 
for arbitrary 0/1-polytopes seems difficult. When the subspace to be projected to is of dimension 
n— 1, we can write the expected number of vertices in the projection as C-^^gy o,{v), where a{v) is 
the solid angle of the cone polar to the tangent cone at vertex v, and C is a constant depending on 
n. (Suitable generalizations of this formula are easy to obtain for projection to dimensions smaller 
than n — 1, but the case of dimension n — 1 is sufficient for our purpose here.) This captures the 
intuitive fact that if the polytope is very pointy at vertex v, then v is more likely to be a vertex in 
the convex hull. It is natural to ask: Given k, what is the S C {0, 1}"" with |5| = k that minimizes 
this expectation? Intuitively, the sum of angles a{v) could be minimized when the vertices are 
close together, as in a Hamming ball. Note the high-level similarity of the problem at hand to the 
edge-isoperimetric inequality for the Boolean cube. Unfortnately, our numerical experiments show 
that this is not the case: Hamming balls are not the minimizers of the expected number of vertices 
of a random projection. 
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